Author Identification from Citations

نویسندگان

  • Joseph K. Bradley
  • Patrick Gage
چکیده

Machine Learning techniques can be applied to citation data from a network of papers to predict the author of a paper that is currently outside of the network. Using a series of models we have found that we can increase the accuracy from past experiments with citation data, by considering the citations as a network. This allows us to predict with confidence the author of a blind paper.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evidence-Based Information Extraction for High Accuracy Citation and Author Name Identification

Citations play an essential role in navigating academic literature and following chains of evidence in research. With the growing availability of large digital archives of scientific papers, the automated extraction and analysis of citations is becoming increasingly relevant. However, existing approaches to citation extraction still fall short of the high accuracy required to build more sophist...

متن کامل

Performance Behavior Patterns in Author-Level Metrics: A Disciplinary Comparison of Google Scholar Citations, ResearchGate, and ImpactStory

The main goal of this work is to verify the existence of diverse behavior patterns in academic production and impact, both among members of the same scientific community (inter-author variability) and for a single author (intra-author variability), as well as to find out whether this fact affects the correlation among author-level metrics (AutLMs) in disciplinary studies. To do this, two sample...

متن کامل

Distributed Open Access Reference Citations Service

We report on the concept, progress and status of the new project Distributed Open Access Reference Citations Services, DOARC . The emphasis is to exploit especially the Open Access documents on Institutional Repositories: analyzing their reference lists for citations, analyzing the full text for research field specific two-word shingles, and use this for powerful author and user tools, one of w...

متن کامل

Towards the Automatic Identification of the Nature of Citations

The reasons why an author cites other publications are varied: an author can cite previous works to gain assistance of some sort in the form of background information, ideas, methods, or to review, critique or refute previous works. The problem is that the best possible way to retrieve the nature of citations is very time consuming: one should read article by article to assign a particular char...

متن کامل

Author gender identification from text using Bayesian Random Forest

Nowadays high usage of users from virtual environments and their connection via social networks like Facebook, Instagram, and Twitter shows the necessity of finding out shared subjects in this environment more than before. There are several applications that benefit from reliable methods for inferring age and gender of users in social media. Such applications exist across a wide area of fields,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006